Assessing Text Readability Using Cognitively Based Indices
نویسندگان
چکیده
Many programs designed to compute the readability of texts are narrowly based on surface-level linguistic features and take too little account of the processes which a reader brings to the text. This study is an exploratory examination of the use of Coh-Metrix, a computational tool that measures cohesion and text difficulty at various levels of language, discourse, and conceptual analysis. It is suggested that Coh-Metrix provides an improved means of measuring English text readability for second language (L2) readers, not least because three Coh-Metrix variables, one employing lexical coreferentiality, one measuring syntactic sentence similarity, and one measuring word frequency, have correlates in psycholinguistic theory. The current study draws on the validation exercise conducted by Greenfield (1999) with Japanese EFL students, which partially replicated Bormuth’s (1971) study with American students. It finds that Coh-Metrix, with its inclusion of the three variables, yields a more accurate prediction of reading difficulty than traditional readability measures. The finding indicates that linguistic variables related to cognitive reading processes contribute significantly to better readability prediction than the surface variables used in traditional formulas. Additionally, because these Coh-Metrix variables better reflect psycholinguistic factors in reading comprehension such as decoding, syntactic parsing, and meaning construction, the formula appears to be more soundly based and avoids criticism on the grounds of construct validity.
منابع مشابه
Cognitively Motivated Features for Readability Assessment
We investigate linguistic features that correlate with the readability of texts for adults with intellectual disabilities (ID). Based on a corpus of texts (including some experimentally measured for comprehension by adults with ID), we analyze the significance of novel discourselevel features related to the cognitive factors underlying our users’ literacy challenges. We develop and evaluate a t...
متن کاملExploring the Relationship Between Modality and Readability Across Different Text Types
With regard to the relationship between the use of modality and readability levels oftexts, 2 opposing views have been raised. The first view endorses direct positiverelationship between modality and readability in the sense that the use of modalityincreases textual understandability. The second view is that the use of modality leadsto an increase in the number of words, resulting in readabilit...
متن کاملAssessing Text Readability Using Hierarchical Lexical Relations Retrieved from WordNet
Although some traditional readability formulas have shown high predictive validity in the r = 0.8 range and above (Chall & Dale, 1995), they are generally not based on genuine linguistic processing factors, but on statistical correlations (Crossley et al., 2008). Improvement of readability assessment should focus on finding variables that truly represent the comprehensibility of text as well as...
متن کاملReadability Indices for Automatic Evaluation of Text Simplification Systems: A Feasibility Study for Spanish
This paper addresses the problem of automatic evaluation of text simplification systems for Spanish. We test whether already-existing readability formulae would be suitable for this task. We adapt three existing readability indices (two measuring lexical complexity and one measuring syntactic complexity) to be computed automatically, which are then applied to a corpus of original news texts and...
متن کاملWhat Can Readability Measures Really Tell Us About Text Complexity?
This study presents the results of an initial phase of a project seeking to convert texts into a more accessible form for people with autism spectrum disorders by means of text simplification technologies. Random samples of Simple Wikipedia articles are compared with texts from News, Health, and Fiction genres using four standard readability indices (Kincaid, Flesch, Fog and SMOG) and sixteen l...
متن کامل